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The level crossing and inverse statistics analysis of DAX and oil price time series are given. 
We determine the average frequency of positive-slope crossings, v+i where T a = is the 

average waiting time for observing the level a again. We estimate the probability P(K,a), 
which provides us the probability of observing K times of the level a with positive slope, in 
time scale T a . For analyzed time series we found that maximum K is about ~ 6. We show 
that by using the level crossing analysis one can estimate how the DAX and oil time series will 
develop. We carry out same analysis for the increments of DAX and oil price log- returns, (which 
is known as inverse statistics) and provide the distribution of waiting times to observe some 
level for the increments. 
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1 Introduction 



Stochastic processes occur in many natural and man-made phenomena, ranging from various 
indicators of economic activities in the stock market, velocity fluctuations in turbulent flows 
and heartbeat dynamics, etc pQ. The level crossing analysis of stochastic processes has been 
introduced by (Rice, 1944, 1945) [2-26], and used to describe the turbulence [16J, rough surfaces 
[27] . stock markets [28], Burgers turbulence and Kardar-Parisi-Zhang equation [291 ED] ■ The 
level crossing analysis of the data set has the advantage that it gives important global properties 
of the time series and do not need the scaling feature. The almost of the methods in time series 
analysis are using the scaling features of time series, and their applications are restricted to the 
time series with scaling properties. Our goal with the level crossing analysis is to characterize 
the statistical properties of the data set with the hope to better understand the underlying 
stochastic dynamics and provide a possible tool to estimate its dynamics. The level crossing 
and inverse statistics analysis can be viewed as the complementary method to the other well- 
known methods such as, detrended fluctuation analysis (DFA) [31], detrended moving average 
(DMA) [32J, wavelet transform modulus maxima (WTMM) [33J, rescaled range analysis (R/S) 
[34J, scaled windowed variance (SWV) |35j, Langevin dynamics [36], detrended cross-correlation 
analysis [37], multifactor analysis of multiscaling [38J, etc. 

We start with formalism of the level crossing analysis. Consider a time series of length n 
given by x(ti),x(t 2 ), ...,x(t n ) (here x(ti) is the log-return of DAX and oil prices). The log- 
return x{ti) is defined as x{ti) = m(yj/yj_i), where i/i is the price at time t,. Let denote 
the averaged number of positive slope crossing of x(t) = a in time scale T = nAt with At = 1 
(we set also the average < x > to be zero). The averaged can be written as N+(T) = v^T, 
where v£ is the average frequency of positive slope crossing of the level a. The positive level 
crossing has specific importance that it gives the next average time scale that the price yi 
will be greater than the again up to specific level. For narrow band processes it has been 
shown that the frequency z/+ can be deduced from the underlying joint probability distributions 
function (PDF) for x and dx/dt = x. Rice proved that [2] 

poo 

v tt = / xp{x = a 1 x)dx 1 (1) 
Jo 

where p(x, x) is the joint PDF of x and x. For discrete time series (of course all of real data are 
discrete) the frequency z/+ can be written in terms of joint cumulative probability distribution, 
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P{xi > a,Xi-i < a) as [39], 

2/+ = P(%i > a,Xi„i < a) 

/a roc 
/ p(xi,Xi-i)dx i dx i - 1 , (2) 
-oo J a 

where p(xi, x^i) is the joint PDF of Xi and The inverse of frequency v£ gives the average 
time scale T a that one should wait to observe the given level a again. 

The rest of this paper is organized as follows: Section 2 is devoted to summary of level 
cross analyzing of DAX and daily oil price log-returns. The inverse statistics of DAX and Oil 
price time series are given in section 3. Section 4 closes with a discussion and conclusion of the 
present results. 

2 level crossing 

Here, at first we provide the results of level crossing analysis for two normalized log-return time 
series, daily German stock market index (the DAX) and daily oil price. The daily fluctuations 
in the oil price and DAX time series were belong to the period 1998-2009. We also study 
the asymmetric properties of level crossing analysis for positive and negative level crossing of 
the time series. To have a comparison we provide also level crossing analysis of synthesized 
uncorrelated noise. Also we will provide the results of level crossing analysis of high frequency 
data for DAX with sample rate 4 (1/min), where we have used 2511000 data points and belongs 
to the period 1994-2003. 

Figure 1 shows the frequency z/+ for daily log-returns of DAX and synthesized uncorrelated 
noise. The PDF of synthesized uncorrelated noise is Gaussian and it has white noise nature 
(i.e. its correlation has delta-function behavior). As shown in figure 1, their level crossing 
frequency are almost similar near to a ~ and have deviation for levels in the tails. The 
difference is related to the non-Gaussian PDF of DAX log- return time series (see below). For 
the normalized Gaussian uncorrelated noise one can show that the frequency u£ is given by 

^ = \[l-erf 2 (a/s/2)\, (3) 

where erf(U) is the error function. In figure 1 a comparison between the analytical and 
numerical results for uncorrelated Gaussian time series is given. For Gaussian uncorrelated 
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data the frequency z/+ behaves as: 

~ - exp(— a 2 /27r) for a — > 

v£ — - exp(— 4a 2 /7r) /or a — > ±00, (4) 

while we have found that z/+ for daily DAX and oil price time series have power- law tails l/|a|^ 
with exponents (3dax = 3.5 ±0.1 and (3 oi [ = 3.8 ± 0.2, respectively (see inset of figure 1). It 
means that the DAX and oil price time series have non-Gaussian tails for their level crossing. 
We note that within the error bars, the exponent (3d ax — Pou- However the exponent may 
depends on the sample rate of data acquisition (see below). The exponent (3 can be estimated 
using the method proposed in [5T| |52| [53] . If the PDF or z/+ follow a power law with exponent 
(3 = k + 1, one can estimate the power-law exponent k by sorting the normalized returns or 
levels by their sizes, a.\ > a 2 > ... > a/v> with the result n = (N — l)^^ 1 In where 
(N - 1) is the number of tail data points. 

In general, there are two reasons to have non-Gaussian tails for level crossing of given time 
series; (i) due to the fatness of the probability density function (PDF) of the time series, in 
comparison to a Gaussian PDF. By definition a fat PDF is defined via the behavior of its tails. 
If its tail goes to the zero slower than a Gaussian PDF then we call it as fat tail PDF. In this 
case, non-Gaussian tails cannot be changed by shuffling the series, because the correlations in 
the data set are affected by the shuffling, while the PDF of the series is invariant, (ii) due 
to the long-range correlation in time series. In this case, the data may have a PDF with 
finite moments, e.g., a Gaussian distribution. The easiest way to distinguish whether the PDF 
shape or long-range correlation is responsible for the fastnesses of v£ f° r the DAX and oil 
log-returns time series, is by analyzing the corresponding shuffled and surrogate time series. 
The level crossing analysis will be sensitive to correlation when the time series is shuffled and 
to probability density functions (PDF) with fat tails when the time series is surrogated. The 
long range correlations are destroyed by the shuffling procedure and in the surrogate method 
the phase of the discrete Fourier transform coefficients of time series are replaced with a set of 
pseudo-independent distributed uniform (—77, +n) quantities. The correlations in the surrogate 
series do not change, but the probability function changes to Gaussian distribution [4"0" t |4"T | W2\. 

In figure 1 the level crossing frequency of shuffled and surrogated DAX time series are given. 
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The figure shows that the frequency z/+ has more difference for original and surrogated time 
series and means that the non-Gaussian tails for i/+ due to the fatness of PDF is dominant [16J. 
We have found similar results for daily oil price log-returns. 

Now let us introduce the PDF, P(K,a), which provides us the probability of observing K 
times of the level a with positive slope, in the averaged time scale T a . By construction the 
average < K >, i.e. < K > \ a = J2k=o KP(K, a) will be unity and P(K,a) satisfies the 
normalization condition Y,k=o P{K-> oi) = 1. In principle we can assume that the upper bound, 
i.e. iV to be infinity. For the processes and levels that satisfy P(0,a) « J2k=i P(K> a )> 
we expect to have a good estimation about the future of process. This means that one will 
observe the level a with high probability in time scale T a at least once. For the levels that the 
PDF P(K,a) satisfies Y% =1 P{K,a) « P(0,a), the process will be not predictable. Figure 2 
shows the PDF P(K, a), for daily DAX and oil price log- return time series for different levels 
a. We plotted also P(K, a) for some levels of synthesized Gaussian uncorrelated data to have 
a comparison. As shown in figure 2 the upper bound N is about 6, which means that it is 
almost impossible to observe same level a in average time scale T a more than 6 times, even for 
the white noise. We used 10 7 data points for white noise synthesized data and found that the 
maximum number of observing is also about 6. 

To find the best interval for the estimation of time series future, we consider the variation 
of the PDF, P(0, a) with respect to the level a. This will enable us to find the range and 
intervals of levels that one can estimate the future of these time series with high accuracy. In 
figure 3 the PDFs P(K = 0, a) for daily DAX, oil and uncorrelated synthesized time series, 
are given. It shows that the P{K = 0, a) of daily DAX and oil time series for the interval 
—0.5 < a < 0.5 has smaller probability with respect to uncorrelated time series. It means that 
with high probability (with respect to white noise), one can observe the level a at least once 
in time scale T a for as belong to this interval. The typical time scale T a for these interval is 
about 4 days for the daily DAX and oil time series, respectively. The corresponding time scales 
T a for different as are shown in figure 4. For the daily DAX and oil price time series (for the 
interval 2 > a > —2) we found the following empirical curve fittings: 

T a (DAX) = 4.10- 0.18a + 4.90a 2 + 0.21a 3 + 0.94a 4 , 
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T a (oil) 



3.90 - 0.37a + 5.70a 2 + 0.57a 3 + 1.69a 4 



(5) 



where T Q 's have the dimension "days". To check the applicability level crossing for forecast of 
the time series, in figure 5 we plotted the daily time series of DAX and indicates the points 
with level a = with red points. The average T a for this level indicated by vertical lines. We 
expect that in this time scale one should observe another red points with high probability. We 
also investigated the asymmetry properties of time series with respect to positive and negative 
slops. 

Finally, we have done similar analysis to the high frequency DAX time series and find that 
the best interval to estimate the future is —0.01 < a < 0.01. The typical time scale belong to 
this levels is about 75sec and the obtained exponent /3 was 2.4 ± 0.1. For this time series, the 
averaged time scale T a depends on the level a as: 



where T a has dimension in seconds. 

3 Inverse statistics 

To the modeling the statistical properties of financial time series Simonsen, et al. [13] asked 
the "inverse" question: what is the smallest time interval needed for an asset to cross a fixed 
return level 7? or what is the typical time span needed to generate a fluctuation or a movement 
in the price of a given size [42-46]? The inverse statistics is the distribution of waiting times 
needed to achieve a predefined level of return obtained from every time series. This distribution 
typically goes through a maximum at a time so called the optimal investment horizon, which 
is the most likely waiting time for obtaining a given return [48J. Let y(t) be the price at time 
t. The logarithmic return calculated over the interval At is, r& t (t) = ln(y(t + At)) — ln(y(t)). 
Given a fixed log-returned barrier, 7, of an index, the corresponding time span is estimated for 
which the log-return of index for the first time reaches the level 7. This can also be called the 
first passage time through the level 7 for r^t- in figure 6, we plotted the probability distribution 
p(r) of normalized waiting time r needed to reach return levels 7 = 0, la, 2a for daily oil, daily 
DAX log-returns and integrated white noise data (i.e. fractional Brownian motion ffim). 



T a (DAX) = 72 - 3a + 186a 2 + a 3 - 4a 4 , 



(6) 
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As figures 6 show, for the zero level for r^t, inverse statistics of the two markets does not 
deviate from fractional Brownian motion while they are rather different behavior from fBm for 
7 = lcr and 2a. We fit the waiting times distribution functions p(r) for different level 7 via the 
Weibull distribution function [54J: 

P(r,T) = |(^- 1 exp[-(^] (7) 

Where 5 is the stretched exponent (or shape parameter) and T is the characteristic time scale. 
We found 5 and T for fBm and Oil and DAX time series and summarized results in Table 1. 

TABLE I. The stretched exponent 5 and characteristic time scale T fitted by Weibull 

distribution for various time series. 
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timeseries 


5 


T 





fBm 


0.423 


1.849 





Oil 


0.347 


1.132 





DAX 


0.312 


5.791 


1 


fBm 


0.704 


13.145 


1 


Oil 


0.496 


7.943 


1 


DAX 


0.600 


6.836 


2 


fBm 


0.956 


34.178 


2 


Oil 


0.878 


32.480 


2 


DAX 


0.988 


18.631 



4 Conclusion 

In summary, we analyzed the DAX and oil daily price log-return time series using the level 
crossing method and find the average waiting time T a for observing the level a again. This 
is a similar analysis as what has been done in Refs. jlHl [50J, [51]. They have been carried out 
the level crossing of the volatility time series, instead of the time series itself. We define and 
estimate the probability of observing K times of the level a, P(K,a) in time scale T a . We 
show that by using the level crossing analysis one can estimate the future of the daily DAX 
and oil time series with good precision for the levels in the interval —0.5 < a < 0.5. Also, using 



7 



the inverse statistics we estimate the waiting time probability distribution for two financial 
markets, i.e. oil and DAX time series. 
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Captions 

FIG. I. The level crossing analysis of the DAX log-returns (original, shuffled and surrogated) 
and uncorrelated Gaussian time series. Inset: the log- log plot of level crossing frequency vs 
level a for DAX log-return time series. The Gaussian uncorrelated time series has exponential 
tails (~ exp(— 4a 2 )), while the daily DAX time series has power-low tails with exponent ~ 3.5. 

FIG. 2. The PDf P(K, a) vs K for different levels a for normalized log-returns time series 
of the daily German stock market index (DAX) (top), oil daily price (bottom) and uncorrelated 
synthesized data. 

FIG. 3. The PDF P(K = 0, a) vs a for DAX, oil daily price and uncorrelated synthesized 
normalized log-return time series. The inset is same figure with wide range of a's. The results 
for uncorrelated synthesized data are plotted to have a comparison. 

FIG. 4. The level dependence of average time T a for daily DAX and oil price log-return 
time series. 

FIG. 5. The points (red) with level a = for daily DAX time series. We expect that in 
this time scale one should observe another red points with high probability. 

FIG. 6. The probability distribution p(r) of normalized waiting time r needed to reach 
return levels at scale r, i.e. 7 = 0, 7 = la and 7 = 2a for two financial markets including oil 
and DAX time series. Solid curves are the fitted curve (Weibul distribution) based on Eq. (7). 
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Figure 1: 
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